Schemaless Representation of Semistructured Data and Schema Construction

نویسندگان

  • Dong-Yal Seo
  • Dong-Ha Lee
  • Kang-Sik Moon
  • Jisook Chang
  • Jeon-Young Lee
  • Chang-You Han
چکیده

We should consider semistructured data of which have a weak schema information in networked information world. To manage such semistructured data eeciently, this paper introduces a data model for semistructured data and operations for schema construction. We transform semistructured data into structured one by introducing schema construction methodology, compared to the former studies which are fully dependent on schemaless manipulations. For schema construction, we deened operations for building IS-A/IS-PART-OF relationships, collecting data objects to build a primitive class, and merging two data instances or classes.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Schemaless Semistructured Data Revisited - - Reinventing Peter Buneman's Deterministic Semistructured Data Model -

This paper reviews the design of data models for semistructured data, particularly focusing on their schemaless nature. Uniform treatment of schema information and data, in other words, uniform treatment of metadata and data, is important in the design of such data models. This paper discusses what data and metadata are, and argues that attribute names, which are usually regarded as metadata, a...

متن کامل

Schema Extraction for Semi-Structured Data

The emerging eld of semistructured data leads to new ways of rep resenting data as schemaless or self describing However in many applications data has often some regularity and ignoring the possibly partial structure hinders the abilities to interpret the data and to access them e ciently In this paper we investigate a knowledge based approach for discovering partial implicit structures from se...

متن کامل

PathLog: a Query Language for Schemaless Databases of Partially Labeled Objects

In the paper we deal with the problem of modeling and querying information in schemaless databases of partially labeled objects (PLO-DB). Partially labeled objects are used for modeling data within repositories integrating both structured and semistructured data. The proposed PLO (Partially Labeled Objects) data model originates from the OEM data model and extends it by allowing partial labelin...

متن کامل

NF-SS: A Normal Form for Semistructured Schema

Semistructured data is becoming increasingly important for web applications with the development of XML and related technologies. Designing a “good” semistructured database is crucial to prevent data redundancy, inconsistency and undesirable updating anomalies. However, unlike relational databases, there is no normalization theory to facilitate the design of good semistructured databases. In th...

متن کامل

Schema Profiling of Document Stores

In document stores, schema is a soft concept and the documents in a collection can have different schemata; this gives designers and implementers augmented flexibility but requires an extra effort to understand the rules that drove the use of alternative schemata when heterogeneous documents are to be analyzed or integrated. In this paper we outline a technique, called schema profiling, to expl...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997